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Abstract 

, Chang's lemma is a useful tool in additive combinatorics and the analysis of Boolean func- 

tions. Here we give an elementary proof using entropy. The constant we obtain is tight, and we 
give a slight improvement in the case where the variables arc highly biased. 



1 The lemma 



u 

. For S € {0, l} n , let \k '■ {±l} n -> K. denote the character 



f(S)=Ef(x) XS (x) = ^J2f( x ^ 



5 ; xs(x) = Y[i 

. For any function / : {±1}™ — > R, we can then define its Fourier transform / : {0, l} n —> R as 

m 

(N 

q 

■ For characters of Hamming weight 1, we will abuse notation by writing f(i) instead of /({«})■ 

Chang's lemma [1, 2] places an upper bound on the total Fourier weight, i.e., the sum of f^, of 
the characteristic function of a small set on the characters with Hamming weight one. 

• 

Lemma 1. Let A C {±l} n such that \A\ = 2 n a, and let f = \a be its characteristic function. 
5— i ■ Then 

d ■ n i 

V/(i) 2 <2a 2 ln-. 

a 

i=i 

Proof. Suppose that we sample x according to the uniform distribution on A. Since the mutual 
information is nonnegative, the entropy H{x) is at most the sum of the entropies of the individual 
bits, 



H{x)<Y j H{x i ) 



i=l 
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This gives 

n 

rain 2 + In a < (1) 



where pf denotes the probability that Xi = +1, 



2 \ xgA y 2 \ a J 
and where h denotes the entropy function 

h(p) = —p\n.p — (1 — p) ln(l — p) . 
The Taylor series around p = 1/2 gives 



so (1) becomes 

i=l 

Rearranging completes the proof. □ 



2 Variations 

The lemma (and our proof) apply equally well to the Fourier weight ^2 S€B f(S) 2 of any basis B of 
Frj, since the set of parities {riie5 Xi I ^ ^ ^} determines x. This gives the following commonly- 
quoted form of Chang's lemma. 

Lemma 2. Let A C {±l} n such that \A\ = 2 n a, and let f = 1^ be its characteristic function. Fix 
p > and let R C F?> be the set {S : |/(<S)| > pa}. Then R spans a space of dimension less than 
d = 2p- 2 ln(l/a). 

Proof. If R spans a space of dimension d or greater, there is a set of d linearly independent vectors 
in R. Completing to form a basis B gives ^2s&B f(^) 2 > 2a 2 ln(l/a), violating Lemma 1. □ 

For any integer k > 1, there are bases consisting entirely of vectors of Hamming weight k. 
Fixing k and averaging over all such bases gives 

S:\S\=k V 7 

This also follows immediately from Shearer's lemma. However, this is noticeably weaker than the 
"weight k bound" 

7(S) 2 = 0(a 2 \og k {l/a)). 

S:\S\=k 
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Figure 1: The entropy function h(p) where p = (l + x)/2 and x < < 1, with the upper bounds (2) 
(which is tight when |x| is small) and (3) (which is tight when \x\ is close to 1). 

Finally, we note that if some bits are highly biased, i.e., if \ f(i)\/a is close to 1, we can replace (2) 
with the bound 

h( P ) < P (i - inp) , (3) 

which is tight when p is small. Combining this with the corresponding bound for p close to 1 gives 

, (\ + x\ l-\x\ / , 1 - \x\\ 

We compare this bound with (2) in Figure 1. This gives another version of Lemma 1: 
Lemma 3. Let A C {±l} n , let f = 1a be its characteristic function, and let 

S ^ = U 1 - l -^ i ) =mm(p+ l-p+) . 

Then 

n 

^(l-ln^)>lnL4|. (4) 

i=i 

This is nearly tight, for instance, if A is the set of vectors with Hamming weight 1. Then \A\ = n, 
5i = 1/n, and (4) reads 1 + Inn > Inn. 
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